Diigo Launches – More Than Just Bookmarking

By ippisl
  • tags: no_tag

    • es, gather sources, and easily publish a post to your blog, Diigo may be your solution. Diigo allows you to add multiple blogs to your account, verify them, and easily publish a post, however you may only publish and cannot manage old entries. What I like is that while you browse the web and you come across a site talking about a specific topic you want to expand on, you can right click and select, “Blog This,” which will then direct you to the blogging area where you can write your post along with that site being your source. The other method is by simply going to your bookmarks section and selecting a bookmark, or multiple bookm
  • tags: test

    • Data mining
    • Data mining is the process of extracting hidden patterns from large amounts of data
    • The term data mining has also been used in a related but negative sense, to mean the deliberate searching for apparent but not necessarily representative patterns in large amounts of data. To avoid confusion with the other sense, the terms data dredging and data snooping are often used. Note, however, that dredging and snooping can be (and sometimes are) used as exploratory tools when developing and clarifying hypotheses.
  • tags: no_tag

    • The following companies can scrape websites for you into desired output formats.
    • The following companies can scrape websites for you into desired output formats.
    • The following companies can scrape websites for you into desired output formats.
  • tags: no_tag

    • Developer’s Guide: Protocol and Java
    • Developer’s Guide: Protocol and Java
    • The Google Notebook Data API allows client applications to view public notebook content in the form of Google Data API feeds. Your client application can request a list of notes from a public notebook, request a list of public notebooks owned by a particular user, and query for notes that match particular criteria.

      Google Notebook feeds are currently read-only and public-only; you can’t send data to Notebook using the Data API, and you can’t get a feed of private Notebook data

  • tags: database, tomography, literature, based, discovery

    • DEVELOPMENT OF QUERIES FOR INFORMATION RETRIEVAL
    • The key problem with most standard search approaches is that the analyst is required to hypothesize the search terms in the context of the application, rather than use the database to provide the search terms appropriate to the context in which they are actually imbedded
    • For general language databases, automated text retrieval using linguistic rules and supporting on-line dictionaries could provide marginally acceptable results for some classes of users. 
    • For highly technical S&T databases, the focus of the text retrieval techniques discussed in the present Appendix, automated text retrieval results in poor retrieval performance. 
    • Natural language processors have severe limitations when applied to highly specialized technical terms.
    • yield the search terms from the language and context of the authors
    • Typical R&D literature surveys have none of these three quality conditions. 
    • When applied to the literature in a technical field, co-word analysis allows a map of the relationship among technical themes to be constructed. 
    • Term Co-occurrence in information retrieval can be used to expand on an initial query,
    • These additional terms could also be used to remove irrelevant documents. 
    • Query Expansion provided the greatest improvement in performance when the original query gave reasonable retrieval results
    • The main idea consists of choosing important terms, or expressions, attached to certain previously retrieved documents that have been identified as relevant by the users, and of enhancing the importance of these terms in a new query formulation
    • Relevance Feedback
    • Performance with feedback improved substantially over the no feedback case. 
    • Results show that terms selected from particular database fields of retrieved items during term relevance feedback (TRF) were more effective than search terms from the intermediary, database thesauri or users’ domain knowledge during the interaction.
    • Classical co-word analysis applied to index/ key words for the purpose of science and technology (S&T) evaluation does not allow the richness of the semantic relationships in full text to be exploited, and it is restricted to formally published papers
    • In order to allow any form of free text to be used, Database Tomography (DT) was developed. 
    • The first step is identification of the main themes of the text being analyzed. The second step is determination of the quantitative and qualitative relationships among the main themes and their secondary themes.
    • The highest frequency technical content phrases are selected by topical experts as the pervasive themes of the full database.
    • Second, for each theme phrase, the frequencies of phrases within some domain centered about the theme phrase are computed for every occurrence of the theme phrase in the full text, and a phrase frequency dictionary is constructed.
    • This phrase frequency dictionary contains the phrases closely related to the theme phrase.
    • Both quantitative and qualitative analyses are performed by the topical expert for each dictionary (hereafter called cluster) yielding, among many results, those sub-themes closely related to and supportive of the main cluster theme. 
    • Third, threshold values are assigned
    • these indices are used to filter out the phrases most closely related to the cluster theme
    • the qualitative analyses of the extracted data by the topical experts have been at least as important as the quantitative analyses.
    • a variety of different analyses can be performed.
    • the final results have been identification of the pervasive technical themes of the database, the relationship among these themes, and the relationship of supporting sub-thrust areas (both high and low frequency)
    • Expert-centric S&T text mining provides an in-depth understanding/ identification of the technical concepts and their inter-relationships, whereas the computer-centric approach focused on the more superficial level of context-free phrases. 
    • Simulated Nucleation
    • a small core group of documents mainly relevant to the topic of interest is identified
    • An inherent assumption is then made that the bibliometric and phrase patterns and phrase combinations characteristic of this relevant core group would be found to occur in other relevant documents.  Therefore, these bibliometric and phrase patterns and phrase combinations can be used to expand the search query,
    • using computer-based clustering techniques for separating the relevant from non-relevant records (e.g., see Hearst (1996) and Zamir (1999) for examples of clustering approaches to separate relevant from non-relevant documents)
    • Kostoff et al, 2005a
    • IV-A.  Overview of Updated Process 

      The operational objective of Simulated Nucleation is to generate a query that will have the following characteristics: 

      *Retrieve the maximum number of records in the technical discipline of interest

      *Retrieve substantial numbers of records in closely allied disciplines

      *Retrieve substantial numbers of records in disparate disciplines that have some connection to the technical discipline of interest

      *Retrieve records in aggregate with high signal-to-noise ratio (number of desirable records large compared to number of undesirable records)

      *Retrieve records with high marginal utility (each additional query term will retrieve large ratio of desirable to undesirable records)

      *Minimize query size to conform to limit requirements of search engine(s) used 

    • To achieve these objectives, the Simulated Nucleation process has been improved and updated, and now contains the following steps: 

      *Definition of study scope

      *Generation of query development strategy

      *Generation of test query

      *Retrieve records from database; select sample

      *Divide sample records into relevant and non-relevant categories

      *Perform computational linguistics on each category

      *Use new algorithms to identify phrases unique to each category

      *Use new algorithms to identify phrase combinations unique to each category

      *Use new algorithms to identify marginal value of adding each phrase and phrase combination to query

      *Construct modified query

      *Repeat process until convergence obtained 

      Each of these steps will now be described in more detail, and the upgrades and improvements will be emphasized. 

    • in general, a separate query had to be developed for each database examined
    • Relation of Query Structure to Database Fields Selected 
    • Relation of Query Structure to Expert(s) Perspectives
    • there is no evidence that a rigorous query of high quality and utility (comparable to those developed using Simulated Nucleation and the semi-structured textual SCI and EC databases) could be made of the highly unstructured Web as it exists now and in the foreseeable future.
      • better for science and tech databases – post by ippisl
    • Generic Term Initialization 
    •   While the emphasis of these later iterations was reduction of non-relevant records
    • Specific Term Initialization 
    • Because of the specificity of the query terms, records relating to the more general theme and scope of the study may be overlooked.
    • having to make a binary decision (on the relevance or non-relevance of a retrieved record) sharpens the focus of the study measurably. 
    • DT has focused on two types of congruency metric patterns for identifying candidate query modification terms: phrase frequencies and phrase proximity statistics
    • and then the phrases in close proximity to selected theme phrases in the Abstracts were also obtained with the DT algorithms.
  • tags: no_tag

    • Diigo Toolbar is now installed on your browser
    • Diigo Toolbar is now installed on your browser
    • Diigo Toolbar is now installed on your browser
  • tags: no_tag

    • what you wan
    • you want
  • tags: no_tag

    • ing our text version of this document.May 24, 2002 The Data W
  • tags: book, marketing, stickiness

    • With an entertaining blend of case studies and startling research, the Heath brothers lay out the critical elements of a sticky idea. They are–

      1. Simplicity
      2. Unexpectedness
      3. Concreteness
      4. Credibility
      5. Emotions
      6. Stories

  • tags: polling, sms, everywhere, startup, service

    • Step1 Ask your audience a question Step2 They answer using SMS text messages or the web (try voting!)   Step3 Results update live in your web browser or PowerPoint
    • Poll Everywhere
    • creative places to use Poll Everywhere:
  • tags: ebooks, kindle, amazon, market, pricing

    • Amazon.com on Monday introduced the Kindle 2
    • The announcement strengthens the bid by Amazon for control of the e-book market
    • most significantly, Amazon said it would start selling e-books that can be read on mobile phones and other devices
    • several incremental improvements
    • The Kindle 2 has
    • new feature, Whispersync, which would allow readers to begin a book on one Kindle and continue, at the same point in the text, on another Kindle or a mobile phone.
    • Google said last week that it would soon sell books from its publishing partners for reading on mobile devices
    • Addressing Google’s initiatives, Mr. Bezos said in an interview that Amazon knows what book buyers want and stressed the company’s digital catalog of 230,000 newer books and best sellers.
    • Apple poses another potential threat to Amazon’s plans.
    • But some worry that Amazon may be assuming too much control over pricing
    • Amazon generally charges $9.99 for the digital versions of best sellers
    • That means that for now, Amazon is taking a loss or making a small margin on the sale of some e-books.
    • Mr. Bezos disagreed. “E-books should be cheaper than physical books.
    • “The thought that there might be one very dominant player who could squeeze most of the profits out of this new market is frightening for authors and publishers,” Mr. Aiken said.
    • publishers have remained vigilant in fostering competition in the e-book market.
  • tags: no_tag

    • s interesting. Will give it a try. I find bookmarking sites an easy affair these days with both Opera and Firefox doing a great job of syncing all the bookmarks online so they are always available no matter what.
  • tags: no_tag

  • tags: no_tag

    • Elements of Scientific Research and Discovery >

       

      ELEMENTS OF SCIENTIFIC RESEARCH AND DISCOVERY: >
      A Study with CER in High-Temperature Superconductivity >

      Sakir Kocabas >

      Dept. of Space Engineering, Istanbul Technical University, >
      Maslak 80626, Istanbul, Turkey. >

      Abstract >

      In this paper we describe a program, CER, which models some of the research activities carried out in the process of the discovery of high-temperature superconductors in 1986 and 1987. These activities include goal and strategy choosing, literature searches, proposing experiments, expectation setting, designing and conducting experiments, data collection, generating and testing hypotheses, modifying hypotheses, and generating explanations. >

    • Elements of Scientific Research and Discovery

       

      ELEMENTS OF SCIENTIFIC RESEARCH AND DISCOVERY:
      A Study with CER in High-Temperature Superconductivity

      Sakir Kocabas

      Dept. of Space Engineering, Istanbul Technical University,
      Maslak 80626, Istanbul, Turkey.

      Abstract

      In this paper we describe a program, CER, which models some of the research activities carried out in the process of the discovery of high-temperature superconductors in 1986 and 1987. These activities include goal and strategy choosing, literature searches, proposing experiments, expectation setting, designing and conducting experiments, data collection, generating and testing hypotheses, modifying hypotheses, and generating explanations.

      CER’s design includes many of the elements of scientific research and discovery and provides a step toward a complete computational model. The system has 17 discovery operators which use over 150 methodological rules many of which are general and applicable to other domains of physics and chemistry.

      Keywords: Scientific discovery, autonomous operators, methodological rules, consistency, completeness, hypothesis generation, scientific explanation.

  • tags: no_tag

    • A YEAR) eyeglasses after finding your site.
  • tags: no_tag

  • tags: no_tag

    • to MIT. As they start doing research with their professors, as many MIT undergraduates do, they learn another healthy lesson, namely, a professor may well behave like a fumbling idiot.

        The drive for excellence and

  • tags: no_tag

    • human body
    • human body
  • tags: no_tag

    • ffering the opportunity for
  • tags: no_tag

    • Nurse-led general practice: the changing face of general practice?

Posted from Diigo. The rest of my favorite links are here.

Leave a Reply